Calculs pour les matrices denses : coût de communication et stabilité numérique. (Dense matrix computations : communication cost and numerical stability)

نویسنده

  • Amal Khabou
چکیده

This dissertation focuses on a widely used linear algebra kernel to solve linear systems, that is the LU decomposition. Usually, to perform such a computation one uses the Gaussian elimination with partial pivoting (GEPP). The backward stability of GEPP depends on a quantity which is referred to as the growth factor, it is known that in general GEPP leads to modest element growth in practice. However its parallel version does not attain the communication lower bounds. Indeed the panel factorization represents a bottleneck in terms of communication. To overcome this communication bottleneck, Grigori et al [60] have developed a communication avoiding LU factorization (CALU), which is asymptotically optimal in terms of communication cost at the cost of some redundant computation. In theory, the upper bound of the growth factor is larger than that of Gaussian elimination with partial pivoting, however CALU is stable in practice. To improve the upper bound of the growth factor, we study a new pivoting strategy based on strong rank revealing QR factorization. Thus we develop a new block algorithm for the LU factorization. This algorithm has a smaller growth factor upper bound compared to Gaussian elimination with partial pivoting. The strong rank revealing pivoting is then combined with tournament pivoting strategy to produce a communication avoiding LU factorization that is more stable than CALU. For hierarchical systems, multiple levels of parallelism are available. However, none of the previously cited methods fully exploit these hierarchical systems. We propose and study two recursive algorithms based on the communication avoiding LU algorithm, which are more suitable for architectures with multiple levels of parallelism. For an accurate and realistic cost analysis of these hierarchical algorithms, we introduce a hierarchical parallel performance model that takes into account processor and network hierarchies. This analysis enables us to accurately predict the performance of the hierarchical LU factorization on an exascale platform.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Managing data persistence in network enabled servers

The GridRPC model [17] is an emerging standard promoted by the Global Grid Forum (GGF) that defines how to perform remote client-server computations on a distributed architecture. In this model data are sent back to the client at the end of every computation. This implies unnecessary communications when computed data are needed by an other server in further computations. Since, communication ti...

متن کامل

Numerical Analysis Level-Set method and stability condition for curvature-driven flows

We consider models for the simulation of curvature-driven incompressible bifluid flows, where the surface tension term is discretized explicitly. From this formulation a numerical stability condition arises for which we present a new theoretical estimation for low and medium Reynolds numbers. We illustrate our analysis with numerical simulations of microfluidic flows using Level Set method. Fin...

متن کامل

Level-Set method and stability condition for curvature-driven flows

We consider models for the simulation of curvature-driven incompressible bifluid flows, where the surface tension term is discretized explicitly. From this formulation a numerical stability condition arises for which we present a new theoretical estimation for low and medium Reynolds numbers. We illustrate our analysis with numerical simulations of microfluidic flows using Level Set method. Fin...

متن کامل

Using Random Butterfly Transformations to Avoid Pivoting in Sparse Direct Methods

We consider the solution of sparse linear systems using direct methods via LU factorization. Unless the matrix is positive definite, numerical pivoting is usually needed to ensure stability, which is costly to implement especially in the sparse case. The Random Butterfly Transformations (RBT) technique provides an alternative to pivoting and is easily parallelizable. The RBT transforms the orig...

متن کامل

A Theory of Complexity of Monadic Recursion Schemes

— Complexity of a monadic recursion scheme is defined through numerical characteristics of trees representing its computations. A class of such complexity characteristics of trees essentially unlike the computation time: so called mimeoinvariant complexity measures, is introduced which induce several dense hiérarchies of complexity classes of monadic recursion schemes of unbounded complexity an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013